Soft missing-feature mask generation for Robot Audition
نویسندگان
چکیده
منابع مشابه
Particle Filter Based Soft-mask Estimation for Missing Feature Reconstruction
In this work, we show how particle filter (PF) based speech feature enhancement can profitably be combined with soft-decision missing feature reconstruction. The combined approach is motivated by the fact that standard minimum mean square error noise compensation techniques fail to give accurate estimates of the clean speech spectrum if the noise spectral power significantly exceeds that of spe...
متن کاملSoft missing-feature mask generation for simultaneous speech recognition system in robots
This paper addresses automatic soft missing-feature mask (MFM) generation based on a leak energy estimation for a simultaneous speech recognition system. An MFM is used as a weight for probability calculation in a recognition process. In a previous work, a threshold-base-zero-or-one function was applied to decide if spectral parameter can be reliable or not for each frequency bin. The function ...
متن کاملPeriodicity and missing feature theory in audition 1.1 The aim
1.1 The aim 1 Harmonicity and spectral envelope Alain de Cheveigné (CNRS/ATR-HIP)
متن کاملEnvironment-independent mask estimation for missing-feature reconstruction
In this paper, we propose an effective mask-estimation method for missing-feature reconstruction in order to achieve robust speech recognition in unknown noise environments. In previous work, it was found that training a model for mask estimation on speech corrupted by white noise did not provide environment-independent recognition accuracy. In this paper we describe a training method based on ...
متن کاملSimultaneous Speech Recognition Based on Automatic Missing Feature Mask Generation by Integrating Sound Source Separation
Our goal is to realize a humanoid robot that has the capabilities of recognizing simultaneous speech. A humanoid robot under real-world environments usually hears a mixture of sounds, and thus three capabilities are essential for robot audition; sound source localization, separation, and recognition of separated sounds. In particular, an interface between sound source separation and speech reco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Paladyn, Journal of Behavioral Robotics
سال: 2010
ISSN: 2081-4836
DOI: 10.2478/s13230-010-0005-1